Explicit Cache Management for Volume Ray-Casting on Parallel Architectures
نویسندگان
چکیده
A major challenge when designing general purpose graphics hardware is to allow efficient access to texture data. Although different rendering paradigms vary with respect to their data access patterns, there is no flexibility when it comes to data caching provided by the graphics architecture. In this paper we focus on volume ray-casting, and show the benefits of algorithm-aware data caching. Our Marching Caches method exploits inter-ray coherence and thus utilizes the memory layout of the highly parallel processors by allowing them to share data through a cache which marches along with the ray front. By exploiting Marching Caches we can apply higher-order reconstruction and enhancement filters to generate more accurate and enriched renderings with an improved rendering performance. We have tested our Marching Caches with seven different filters, e. g., Catmul-Rom, Bspline, ambient occlusion projection, and could show that a speed up of four times can be achieved compared to using the caching implicitly provided by the graphics hardware, and that the memory bandwidth to global memory can be reduced by orders of magnitude. Throughout the paper, we will introduce the Marching Cache concept, provide implementation details and discuss the performance and memory bandwidth impact when using different filters.
منابع مشابه
Distributed-Memory 3D Rendering with Object Migration
Object dataflow is a popular approach used in parallel rendering. The data representing the 3D scene is statically distributed among processors and objects are fetched and cached only on demand. Most previous methods were implemented on shared memory architectures and exploited only object-space coherency to reduce cache misses. In this paper, we propose an efficient model for object dataflow o...
متن کاملDynamic Grid Refinement for Fluid Simulations on Parallel Graphics Architectures
We present a physically-based fluid simulation with dynamic grid refinement on parallel SIMD graphics hardware. The irregular and dynamic structure of an adaptive grid requires sophisticated memory access patterns as well as a decomposition of the problem for parallel processing and the distribution of tasks to multiple threads. In this paper, we focus on the representation and management of th...
متن کاملLatency- and Hazard-Free Volume Memory Ar chitecture for Direct Volume Rendering
The computational power required for direct volume rendering like ray-casting or volume ray-tracing can be provided by high speed rendering architectures. However the increasing proces sor speed makes a performance bottleneck obvious the vol ume memory. This paper describes a volume memory architec ture that achieves at least a tenfold speed-up in read-out rate with moderate additional hard...
متن کاملThree Architectures for Volume Rendering
Volume rendering is a key technique in scientific visualization that lends itself to significant exploitable parallelism. The high computational demands of real-time volume rendering and continued technological advances in the area of VLSI give impetus to the development of special-purpose volume rendering architectures. This paper presents and characterizes three recently developed volume rend...
متن کاملMemory Management of Large Data Sets in Volume Visualization Systems
Significant improvements in hardware components result in the potential to process and render large volumes with complex features and interactions. This has challenged system developers to manage several gigabytes of memory at interactive rates. Ray casting a 1024 volume with 2 bytes/voxel in 30Hz requires memory bandwidth of 60GB/sec. Some of the volumetric system design issues are: time to ma...
متن کامل